Spatially Prioritized and Persistent Text Detection and Decoding
نویسندگان
چکیده
We show how to exploit temporal and spatial coherence to achieve efficient and effective text detection and decoding for a sensor suite moving through an environment in which text occurs at a variety of locations, scales and orientations with respect to the observer. Our method uses simultaneous localization and mapping (SLAM) to extract planar “tiles” representing scene surfaces. It then fuses multiple observations of each tile, captured from different observer poses, using homography transformations. Text is detected using Discrete Cosine Transform (DCT) and Maximally Stable Extremal Regions (MSER) methods; MSER enables fusion of multiple observations of blurry text regions in a component tree. The observations from SLAM and MSER are then decoded by an Optical Character Recognition (OCR) engine. The decoded characters are then clustered into character blocks to obtain an MLE word configuration. This paper’s contributions include: 1) spatiotemporal fusion of tile observations via SLAM, prior to inspection, thereby improving the quality of the input data; and 2) combination of multiple noisy text observations into a single higher-confidence estimate of environmental text. Keywords—SLAM, Text Detection, Video OCR, Multiple Frame Integration, DCT, MSER, Lexicon, Language Model
منابع مشابه
Prioritized Degree Distribution in Wireless Sensor Networks with a Network Coded Data Collection Method
The reliability of wireless sensor networks (WSNs) can be greatly affected by failures of sensor nodes due to energy exhaustion or the influence of brutal external environment conditions. Such failures seriously affect the data persistence and collection efficiency. Strategies based on network coding technology for WSNs such as LTCDS can improve the data persistence without mass redundancy. How...
متن کاملPrioritized Information Recovery for Wireless Link-Layer Communication
In this paper we develop Prioritized Automatic Code Embedding (PACE) link-layer protocol to achieve preferred data recovery order across connections, while maintaining stable and reliable data flows over wireless networks. We classify link-layer traffic arrivals into different priorities based on the packet delay constraint and the distortion associated with the loss of that packet. The traffic...
متن کاملOn Detecting Spatially Similar and Dissimilar Objects Using Adaboost
AdaBoost has been verified to be proficient in processing images rapidly while attaining high detection rate in face detection. The speed of AdaBoost in face detection is demonstrated in [1], where the detection can be performed in 15 frames per second. The robust speediness and the high accuracy in tracing the target objects have enable AdaBoost to be successful in classification problems. In ...
متن کاملReading the Hidden Concepts in the Text of Tehran Highways Walls
Because of the dispersion of activities in big cities, employing highways as the most important connecting way is indispensable. In today world, highways are connection arteries of cities transporting people to their social activities. Because of continual use of highway, their walls prepare a context for conveying many Social, Cultural, Economic, etc concepts. That is why most economic promoti...
متن کاملDocument Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)
Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...
متن کامل